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1.0  INTRODUCTION  AND  BACKGROUND 


x  1.1  INTRODUCTION 

^  This  paper  Is  Intended  as  a  primer  on  Image  compression  techniques 
as  they  apply  to  compressing  Synthetic  Aperture  Radar  (SAR)  Imagery. 
Particular  attention  will  be  focused  on  two  techniques  which  are 
thought  to  be  particularly  suitable  to  this  particular  application 
because  of  their  simplicity  and  speed,  namely  Vector  Quantization  (VQ) 
and  Block  Truncation  Coding  (BTC).  More  well-known  techniques  such  as 
Differential  Pulse  Code  Modulation  (DPCM)  and  Discrete  Cosine  Trans¬ 
form  (DCT)  will  also  be  discussed.  Recommendations  for  experiments  on 
SAR  Imagery  are  made.j 

1.2  UNIQUE  ASPECTS  OF  SAR 

^  SAR  Imagery  differs  from  conventional  visual  band  photographic  or 
electro-optical  Imagery  In  two  significant  respects:  first  by  having  a 
large  dynamic  range  (as  great  as  96  dB),  and  second,  by  the  presence 
of  speckle  noise,  the  sort  of  noise  seen  In  optical  holography, (Ref 's 
7-1-8).  Since  SAR  Images  are  produced  by  coherent  processing  of  x,- 
scatterlng-type  signal  reflections,  speckle  effects  are  produced  by 
what  has  been  described  as  Intra-pixel  Interferometry:  the  coherent 
signal  Interaction  between  separate  scatterers  within  a  .Ingle  pixel. 

This  speckle  phenomenon  Is  always  present  and  varies  from  pulse  to 
pulse  for  a  given  pixel.  Speckle  contains  Information  pertaining  to 

the  size  and  location  of  terrain  scatterers  and  perhaps  to  sub-pixel 
textural  Information  (Tomlyasu  [8]),  but  for  most  common  SAR  imaging 
applications  this  Information  Is  of  no  concern,  and  In  fact  can  be 
described  as  signal -dependent  (multiplicative)  noise.  In  holography 
this  degradation  has  been  estimated  as  causing  a  reduction  In  resolu¬ 
tion  such  that  an  aperture  2.6  times  as  large  as  that  required  for  a 
speckle-free  Image  would  be  necessary  for  equivalent  resolution 
(Kozma,  [2]).  Speckle  noise  likely  contributes  to  a  reduction  In 
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adjacent  cell  correlation  according  to  a  Rayleigh  model  described  by 
Wu  [3].  Spatial  lowpass  filtering  can  be  used  to  smooth  the  speckle 
noise,  but  edge  sharpness  Is  reduced  by  such  filtering.  Multiple-look 
averaging  will  Improve  the  Image  quality  without  blurring  edges,  but 
at  the  expense  of  requiring  collection  of  the  additional  Information. 
Numerous  approaches  to  the  speckle-reduction  problem  have  been  stud¬ 
ied.  Llm  and  Nawab  [4]  observed  that  grey  scale  modification  could  be 
used  to  Improve  the  Image.  This  Is  of  note  since  grey  scale  modifica¬ 
tion  represents  a  dynamic  range  adjustment  (ORA);  thus  It  may  be 
expected  that  a  logarithmic  type  of  ORA  will  decrease  the  speckle 
effect.  The  same  authors  found  some  Improvement  using  homomorphic 
filtering  such  as  described  by  Oppenhelm  [14].  A  homomorphic  Wiener 
filter  was  found  effective  by  Jain  and  Christensen  [6],  Tur,  Chin, 
and  Goodman  [7]  have  observed  that  the  multiplicative  noise  model  Is 
not  always  valid  for  speckle,  specifically,  when  the  object  Imaged 
contains  fine  details  which  cannot  be  resolved  by  the  Imaging  system. 
Lee  [9]  has  proposed  a  filtering  approach  called  Sigma  Filtering,  In 
which  each  pixel  Is  replaced  by  an  average  of  those  neighboring  pixels 
which  have  their  amplitude  levels  within  two  standard  deviations  of 
that  pixel.  Lee's  approach  minimizes  degradation  of  fine  detail,  as 
does  the  recent  approach  of  Crlmmorts  called  Geometric  Filtering  [10], 
[11].  Geometric  Filtering  Is  an  Iterative  non-linear  geometric  ap¬ 
proach  using  the  convex  hull  description  of  a  pixel  set.  Crlmmons 
defines  a  Speckle  Index  as  the  ratio  of  deviation  to  mean  of  a  block 
of  pixels,  and  demonstrates  superior  reduction  of  speckle  so  defined 
using  geometric  filtering  as  compared  with  look  averaging. 

Speckle  reduction  Is  considered  relevant  to  Image  compression 
because!  (1)  In  the  case  of  the  Discrete  Cosine  Transform  (DCT), 
speckle  very  likely  causes  the  algorithm  to  assign  bits  to  transform 
domain  regions  (such  as  the  high  spatial  frequencies)  which  typically 
should  receive  very  few  bits,  thus  forcing  an  unwanted  reallocation  of 
bits;  (2)  In  the  case  of  Block  Truncation  Coding  (BTC),  the  algorithm 
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codes  and  transmits  block  standard  deviation,  and  since  speckle  repre¬ 
sents  a  non  Information-bearing  component  (for  typical  exploitation 
applications)  which  contributes  to  the  block  var'^ee,  It  may  well 
contribute  to  the  coding  of  unnecessary  bits  for  the  block  variance. 


3 


2.0  COMPRESSION  AND  CODING 


General  discussions  on  compression  are  contained  In  References 
[12-15].  Brief  discussions  on  Remapping  and  Log  compression  are  given 
below,  followed  by  descriptions  of  Differential  Pulse  Code  Modulation 
(DPCM) ,  Transform  Domain  approaches  such  as  the  Discrete  Cosine  Trans¬ 
form  (DCT) ,  and  finally  by  discussions  of  Block  Truncation  Coding 
(BTC)  and  Vector  Quantization  (VQ). 

2.L  REMAPPING 

Remapping  techniques  are  those  which  can  be  Implemented  by  the  use 
of  simple  lookup  tables  which  map  one  Image  pixel  amplitude  Into 
another  amplitude  on  the  basis  of  Input  amplitude.  Strictly  speaking, 
remapping  may  or  may  not  result  In  a  compression  of  the  number  of  bits 
per  pixel,  but  may  simply  expand  or  contract  the  r^nge  of  output  value 
for  a  corresponding  Input  range  In  order  to  Improve  viewing  or  to 
match  the  characteristics  of  a  display  device  (Matthews,  et  al  [16]). 
This  Is  sometimes  performed  to  expand  the  contrast  over  a  desired 
range  of  amplitudes  where  desired  features  are  felt  to  lie,  and  Is 
then  referred  to  as  Dynamic  Range  Adjustment  (DRA).  If  the  output 
range  Is  re-quantized  at  fewer  bits  per  pixel,  a  data  compression 
results.  Llpes  [17 J  reports  on  compression  of  SAR  using  various 
quantizations.  The  quantizing  Is  optimally  done  so  as  to  result  In 
minimum  average  error  In  the  quantized  signal  by  apportioning  the 
quantizing  ranges  as  described  by  Max  [18]. 

2.2  LOG  COMPRESSION 

.  A  particular  remapping,  the  log  transform,  has  been  recommended 
for  compression  of  SAR  Imagery  by  Crowe,  et  al  [19].  An  actual  exam¬ 
ple  of  such  a  remapping  used  to  achieve  compression  Is  the  log  com¬ 
pression  presently  In  use  In  the  ASARS,  In  which  a  96  dB  dynamic  range 
has  been  achieved  by  Incorporating  a  scaling  parameter  derived  from 
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the  system  AGC  Into  the  log  compression.  Informal  experiments  by  the 
ASARS  contractor  have  resulted  In  pleasing  results  from  using  a  simple 
square-root  compression  having  only  48  dB  of  dynamic  range. 

2.3  SPATIAL-DOMAIN  ENCODING 

Differential  Pulse  Code  Modulation  (DPCM)  Is  a  compression  tech¬ 
nique  that  has  been  subject  to  much  examination  in  the  literature,  and 
thus  will  only  be  dealt  with  briefly  here.  Lynch  [12]  and  Oppenhelm 
[14]  provide  good  general  descriptions.  DPCM  algorithms  achieve  com¬ 
pression  by  employing  an  embellishment  on  what  Is  called  "delta  modu¬ 
lation",  where  only  a  one-bit  estimate  (slope  up  or  slope  down  esti¬ 
mate  of)  the  difference  between  adjacent  pixels  rather  than  their 
absolute  value  Is  transmitted.  In  the  case  of  DPCM,  a  predictor  Is 
employed  to  make  an  estimate  of  an  adjacent  pixel  amplitude  based  on 
the  value  of  one  or  more  of  Its  neighbors.  Identical  predictor  equa¬ 
tions  are  employed  at  both  the  sending  and  receiving  ends,  thus  making 
It  possible  to  store  or  transmit  only  the  difference  between  the 
actual  and  predicted  values.  Considering  a  spatial  matrix  of  pixels 
with  a  single  pixel  designated  as  x(m,n),  a  typical  linear  predictor 
equation  might  have  the  forms 

x(m,  n)  ■  +0.75*x(m,  n  -  1)  +  0.75*x(m  -1,  n)  -  0.5*x(m  -  1,  n  -  1) 

If  all  three  adjacent  pixels  (left,  above,  and  above-left)  were  equal, 
the  predicted  value  would  be  equal  to  that  value,  whereas  If  the 
"above-left"  pixel  were  low  In  amplitude,  the  predicted  value  would  be 
correspondingly  higher  since  the  three  pixels  would  appear  to  define  a 
plane  sloping  upwards  toward  the  pixel  to  be  predicted.  Although  the 
coefficient  terms  above  add  to  unity,  this  need  not  be  the  case  In 
general.  Calculation  of  predictor  coefficients  based  on  the  statis¬ 
tics  of  the  Imagery  will  result  In  appropriate  coefficients  for  that 
Imagery.  Subtracting  the  predicted  value  from  the  actual  pixel  value 
at  the  point  of  the  original  Image  leaves  an  error  term  which  can  be 
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quantized  Into  fewer  bits  than  the  original  pixels  and  then  transmit¬ 
ted  or  stored.  Practical  DPCM  systems  typically  achieve  average  com¬ 
pressed  quantization  levels  of  between  4  and  S  bits  per  pixel.  The 
design  of  the  quantizer  lookup  tables  should  be  done  according  to  the 
nature  of  the  type  of  Imagery  to  be  transmitted.  The  error  signal  of 
SAR  will  most  likely  have  a  different  probability  distribution  than 
the  corresponding  error  signal  for  photographic  or  IR  imagery.  An 
optimum  quantizer  will  have  Its  quantization  ranges  apportioned  so  as 
to  minimize  the  overall  error  probability  on  the  average;  different 
Quantizer  Tables  should  thus  be  employed  according  to  the  class  of 
Imagery.  OPCM  systems  designed  In  two  dimensions  such  as  the  above 
equation  are  often  described  as  2D-DPCM  to  distinguish  them  from 
single  dimension  or  temporal  schemes  such  as  used  with  time  waveforms 
like  speech.  Adaptive  OPCM  (AOPCM)  applies  to  systems  which  adapt  the 
predictor  or  the  quantizer  to  the  statistics  of  the  Input  image  on  a 
block  basis.  Practical  OPCM  systems  typically  achieve  average  com¬ 
pressed  quantized  levels  of  between  4  and  5  bits  per  pixel,  although 
Werness  [20]  has  described  a  dual  rate  predictor  (AOPCM)  which 
achieves  coding  rates  ranging  between  1.3  and  2.3  bits  per  pixel  for 
SAR  data. 

2.4  BLOCK  COOING  * 

Block  coding  of  Images  Includes  those  coding  or  compression  tech¬ 
niques  which  operate  on  blocks  of  data  samples,  typically  square  in 
the  case  of  Imagery.  Such  algorithms  Include  the  transform  techniques 
such  as  OCT,  and  quantization  techniques  such  as  Block  Truncation 
Coding  (BTC),  and  Vector  Quantizing  (VQ).  These  algorithms  typically 
process  each  block  differently  according  to  the  local  statistics  of 
each  block,  so  a  common  concern  among  them  Is  the  occurrence  of  visi¬ 
ble  block  boundaries  In  some  processing  situations  and  at  larger  block 
sizes  (say  16x16  blocks  as  opposed  to  4x4  blocks).  Hablbl  [21]  and 
Gonsalves  [22]  provide  comparisons  of  block  techniques  with  OPCM. 
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2.4.1  Transform-Domain  Techniques 


Transform  domain  technlquas  (Wlntz,  [23])  In  general  achieve  their 
effectiveness  by  taking  advantage  of  the  fact  that  the  Information 
contained  in  some  of  the  transformed  values  of  the  Image  Is  greater 
than  In  others.  That  Is,  transforming  a  spatial  Image  using  a  common 
transformation  such  as  the  Discrete  Fourier  Transform,  one  obtains  an 
orthogonal  or  uncorrelated  set  of  coefficient  values  In  the  spatial 
frequency  domain?  the  Information  contained  In  the  high  spatial  fre¬ 
quency  coefficients  has  been  shown  to  be  of  less  value  than  those  of 
lower  frequency.  It  Is  thus  possible  to  either  dispose  of  the  less 
Important  coefficients,  or  assign  fewer  bits  to  them  in  an  encoding 
scheme  prior  to  storage  or  transmission.  A  number  of  transforms  have 
been  Investigated  over  the  years,  and  It  has  been  shown  that  the  opti¬ 
mal  orthogonal  transform  from  a  coefficient  truncation  mean-square 
error  (MSE)  sense  Is  the  Karhunen-Loeve  (K-L)  transform.  However,  no 
efficient  algorithm  exists  for  computing  the  K-L  transform.  Given 
this  fact,  other  transforms  which  do  have  "fast"  algorithms  have  been 
Investigated,  Including  the  Haar,  Slant,  Walsh,  and  the  DFT/FFT.  It 
Is  generally  concluded  that,  for  Harkov  sources,  the  DCT  gives  the  MSE 
performance  closest  to  the  optimum  K-L  results,  as  described  by  Ahmed, 
et  al  [24].  Gloutsos  and  Werness  [25]  have  found  that  In  the  case  of 
SAR  Imagery,  the  Sine  transform  rather  than  the  Cosine  transform  Is 
optimum.  It  should  be  noted  that  the  MSE  measure  of  performance  Is 
commonly  used  because  It  Is  objective  and  can  be  easily  be  measured? 
however,  It  Is  a  plxel-orlentud  measure,  and  may  not  be  truly  Indica¬ 
tive  of  the  actual  spatial  feature  distortion.  Werness  [15]  has  pro¬ 
posed  a  set  of  metrics  for  evaluating  the  performance  of  compression 
schemes  that  take  Into  account  the  unique  nature  of  SAR  imagery;  the 
metrics  quantify  two  types  of  noise  common  In  predictive  systems  such 
as  ADPCMs  slope  overload  and  large  quantizer  step  size  noise. 
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As  compared  to  simple  algorithms  such  as  the  BTC  or  simple  spatial 
VQ,  the  OCT  Is  computationally  Intensive,  but  can  result  In  compres¬ 
sion  rates  below  2  bits/pixel  with  little  visual  degradation.  The 
heart  of  the  scheme  Is  a  systematic  apportionment  of  quantization  bits 
across  the  set  of  transform  coefficients,  with  those  coefficients  In 
the  "upper  left"  of  the  coefficient  matrix,  corresponding  to  those 
components  closer  to  zero  spatial  frequency,  being  assigned  the 
greater  number  of  bits.  The  highest  spatial  frequency  components  are 
often  discarded  or  assigned  only  one  bit.  Each  coefficient  Is  then 
quantized  according  to  the  number  of  assigned  bits,  with  quantization 
lookup  tables  available  for  each  bit-assignment  case.  The  quantizer 
tables  must  be  optimized  so  as  to  minimize  the  quantization  error, 
thus  taking  Into  account  the  probability  distribution  of  the  transform 
coefficients.  In  order  to  achieve  consistent  performance,  It  then 
becomes  necessary  to  engage  In  normalization  of  the  Input  blocks  prior 
to  performlng^the  20  transforms  from  the  spatial  domain.  It  Is  typi¬ 
cal  to  do  the  transforms  using  blocks  on  the  order  of  16x16.  In  addi¬ 
tion  to  the  computation  Intensity  of  the  two  dimensional  cosine  trans¬ 
form,  the  normalization  calculations  and  selection  of  bit  assignment 
rules  add  significantly  to  the  complexity  of  DCT  types  of  transforms. 
Chen  and  Smith  [26]  describe  a  practical  OCT  Implementation.  Re¬ 
cently,  a  two  dimensional  Fast  Cosine  Transform  (FCT)  has  been  Intro¬ 
duced  by  Lee  [27]  as  a  means  of  Improving  on  the  computational  effi¬ 
ciency  of  the  DCT,  with  refinements  described  by  Haque  [28]. 

2.4.2  Block  Truncation  Coding  (BTC) 

A  recently  developed  spatial  domain  compression  technique  is  Block 
Truncation  Coding  (BTC).  References  [29] -[39]  describe  BTC  and  Its 
application  to  visual-domain  Imagery.  The  application  to  Seasat  SAR 
Imagery  Is  described  by  Frost  in  [39].  Figure  2-1  Illustrates  the  BTC 
procedure.  The  BTC  algorithm  uses  a  two-level  (one-bit)  non- 
parametrlc  quantizer  which  adapts  to  preserve  the  first  two  local 
moments  of  the  Image.  Non-overlapping  blocks  (typically  4  x  4  or 
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FIGURE  2-1.  BTC  TECHNIQUE 


Suppose  a  4  x  4  Image  block  consists  of  the  following  pixel 
amplitudes: 
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199 
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249 
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170 

30 
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249 

The  block  mean  and  standard  deviation  are 

thus: 

x  •  mean  a  97.63 
<r  ■  std.  dev.  ■  93.79 


with 

k  ■  lt»  ■  number  of  elements  In  the  block 


Using  the  block  mean  as  the  threshold  value, 

q  ■  7  ■  the  number  of  pixels  above  the  threshold 


yields:  a  ■  14  (to  the  nearest  Integer) 
b  -  203 


for  the  "low"  and  "high"  values  respectively. 


The  bit  plane  Is  then 

110  0 
0  111 
0  0  0  1 
0  0  0  1 


resulting  In  a  reconstructed  block  of 

203  203  14  14 

14  203  203  203 

14  14  14  203 

14  14  14  203 

which  preserves  the  block  mean  and  standard  deviation. 
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8x8  pixels)  are  replaced  with  pixels  quantized  to  one  of  two  levels 
so  that  the  mean  and  variance  of  the  block  are  maintained.  The  two 
levels  for  each  block  represent  local  "bright"  and  "dark".  A  bit 
plane  of  one's  and  zero's  Is  transmitted  representing  each  pixel  as 
either  high  (bright)  or  low  (dark),  corresponding  to  pixels  above  or 
below  the  block  means.  Also  transmitted  are  quantized  values  of  the 
block  mean  (corresponding  to  the  average  pixel  brightness)  and  block 
standard  deviation  (corresponding  to  the  pixel  contrast  within  the 
block).  The  receiver  or  decoder  reconstructs  an  approximation  to  the 
original  block  by  replacing  each  pixel  having  a  "high"  value  repre¬ 
sented  by  a  "one"  In  the  bit  plane  by  a  value  equal  to 

b  ■  x  +  9\  (k  -  q)/q 

and  points  repressed  by  a  "zero"  In  the  bit  plane  by 

a  ■  x  -  dq/(k  -  q) 

where  x  ■  block  mean 

a  -  block  standard  deviation 

k  ■  number  of  points  In  block 

q  ■  number  of  points  above  the  block  mean 

Thus  each  point  above  the  threshold  (the  original  block  mean)  Is  re¬ 
placed  by  value  equal  to  "b",  and  those  below  the  mean  by  a  value 
equal  to  "a".  The  process  Is  Illustrated  In  Figure  2-1.  The  threshold 
Is  commonly  chosen  a  priori  to  be  the  block  mean,  although  other  more 
complex  calculated  thresholds  have  been  proposed  In  order  to  preserve 
higher-order  moments.  As  seen  In  Figure  2-1,  the  compressed  block 
consists  of  pixels  having  a  value  above  the  original  block  mean  (the 
204-values)  and  pixels  below  the  mean  (the  17-values).  The  block  mean 
and  variance  of  this  new  block  are  Identical  to  those  of  the  original 
block.  The  transformation  Is  repeated  for  every  (non-overlapping) 
block  over  the  Image,  with  new  high  and  low  values  calculated  for  each 
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block.  The  average  value  (or  average  brightness)  and  variance  (corre¬ 
sponding  to  the  contrast  between  pixels)  are  thus  adaptive  over  the 
Image,  adjusting  to  fit  the  Image  at  each  4x4  area,  for  example. 
The  required  number  of  bits  to  be  transmitted  Is  simply 

m2  +  N(mean)  +  NSTDEV , 

where  m2  >  block  size  (4  x  4  ■  18  for  a  square  4x4  block)  and  NMean, 
NSTDEV  are  the  number  of  bits  used  to  encode  the  block  mean  and  block 
standard  deviation.  If  the  mean  and  standard  deviation  were  each 

represented  by  8  bits,  then  a  4  x  4  block  would  require  16  bits  for 

the  1-blt  mask  and  2  x  8  •  16  bits  for  the  mean  and  standard  devia¬ 
tion,  giving  a  total  of  32  bits  for  16  pixels,  or  2.0  bits  per  pixel. 

If  8  x  8  blocks  were  similarly  encoded,  the  result  would  be  80  bits 
for  64  pixels,  or  1.2S  bits  per  pixel.  The  performance  Is  seen  to  be 
asymptotic  to  1  bit  per  pixel  with  Increasing  block  size.  The  general 
formula  for  the  number  of  bits  per  pixel  Is  slmplyi 

bpp  ■  1  +  (NM  +  NV) / (Block  Size) 
where  bpp  ■  bits/pixel 

NM  ■  Number  of  bits  used  to  encode  the  block  mean 
NV  -  Number  of  bits  used  to  encode  the  block  standard 
deviation. 
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3.0  BTC  RESULTS 


Tht  result!  of  Block  Truncation  Coding  of  SAR  Imagery  are  shown  In 
Figures  3-1  through  3-4.  These  figures  were  obtained  photographically 
using  a  OeAnza  Image  processing  softcopy  (CRT)  display  having  512  x 
512  display  resolution  with  256  levels  of  Intensity  displayed.  For 
this  Investigations  original  STAR-1  SAR  Image  of  Belle  Isle  and 
Belle  Isle  Bridge  In  the  Detroit  River  was  used  (Figure  3-1).  The 
digital  image,  having  15  bits/pixel,  Is  first  mapped  to  8  bits/pixel 
using  a  1  In/log  dynamic  range  adjustment  (DRA)  as  is  typical  for  SAR 
data  display.  No  transmission  error  effects  are  Included.  The  mean 
and  standard  deviation  parameters  are  quantized  at  8  bits  (per  block) 
each.  The  results  of  the  4  x  4  BTC  (2.0  bits  per  pixel),  8  x  8  BTC 
(1.25  bits/pixel)  and  16  x  16  BTC  (1.06  bits/pixel)  are  presented  for 
the  same  SAR  scene  In  Figures  3-2  through  3-4  respectively.  The  BTC4 
results  are  clearly  superior  to  those  of  BTC8  and  BTC16;  the  marginal 
Improvement  In  compression  available  by  Increasing  block  size  Is  small 
as  BTC  asymptotically  approaches  1  bit  per  pixel.  As  the  BTC  block 
size  Increases,  the  underlying  blocks  can  be  seen  In  the  Image,  par¬ 
ticularly  for  the  16  x  16  BTC.  However,  flicker  comparisons  between 
BTC4  and  the  original  Images  reveal  almost  no  differences:  some  bright 
pixels  are  reduced  slightly  In  amplitude,  while  some  darker  pixels 
Increase  slightly,  but  shapes  and  edges  are  preserved  quite  well.  A  4 
x  4  block  size  Is  so  small  that  It  Is  virtually  unnotlceable  to  an 
observer.  Thus  we  conclude  that  SAR  data  compression  up  to  2  bits/ 
pixel  can  be  achieved  using  BTC  with  4x4  blocks  without  significant 
loss  of  Image  quality, 

BTC  has  sometimes  been  considered  as  a  special  case  of  Vector 
Quantizing,  discussed  below.  One  significant  difference  lies  In  the 
non-parametrlc  or  adaptive  nature  of  BTC;  the  algorithm  adjusts  to  the 
statistics  of  each  Individual  scene  on  a  block  by  block  basis. 
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FIGURE  3-1.  FULL  RESOLUTION  SAR  IMAGE 


15 


FIGURE  3*2.  4  X  4  BTC  IMAGE 
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87*1 1357 


87*10489*7 


FIGURE  3-3  8X8  BTC  IMAGE 
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17-1 0488*10 

FIGURE  3-4,  16  X 16  BTC  IMAGE 
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3.1  COMPARISON  OF  BTC  WITH  DPCM 

In  comparison  with  adaptive  DPCM  (ADPCM) ,  probably  tha  most  preva- 
lant  example  of  a  spatial -domain  comprasslon  technique,  a  numbar  of 
aspacts  of  BTC  ara  notaworthy.  BTC  Is  Inharantly  block  adaptive  In 
that  all  tha  coding  paramatars  hava  tha  ability  to  changa  at  avary 
block.  DPCM  Is  computationally  mora  complax  than  BTCi  a  thraa-stap 
ADPCM  pradlctor  basket  cn  pixels  to  tha  left,  above,  and  at  a  45°  angle 
would  require  three  multiplications  for  avary  pixel  for  tha  prediction 
process  alone.  Tha  ADPCM  coder  loop  Is  mora  complax  than  tha  BTC 
coding  process,  requiring  an  AOPCM  decoder  In  tha  pradlctor  feedback 
loop  as  tha  cost  required  to  achieve  somewhat  higher  expected  quality 
than  with  BTC. 

Channel  errors  are  observed  In  the  reconstructed  Image  differently 
In  BTC  and  DPCM.  In  DPCM  the  channel  errors  appear  as  streaks  In  the 
Image.  If  the  DPCM  predictor  Is  not  adaptive  or  new  Initial  condi¬ 
tions  ar4  not  sent  after  a  few  rows  of  the  Image,  then  these  streaks 
will  propagate  throughout  the  entire  Image..  The  amount  of  propagation 
Is  determined  by  the  type  of  predictor  used.  If  a  Moving  Average 
(MA)  predictor  Is  used,  there  will  be  no  streaking.  For  BTC  the 
errors  appear  as  bad  blocks  of  data  (small  and  localized  In  the  case 
of  4  x  4  pixel  blocks). 

Reference  [31]  describes  a  DPCM  algorithm  capable  of  compressing 
photographic  Images  from  11  to  3  bits/pixel  with  Indiscernible  (sub¬ 
jective)  visual  degradation.  A  conservative  BTC  approach  (4  x  4,  with 
11  bits  per  Input  pixel  and  11  bit  mean  and  standard  deviation)  would 
achieve  2.375  bits/pixel,  by  comparison.  An  Interesting  combination 
of  DPCM  and  BTC  has  been  suggested  In  Reference  [32],  In  which  the 
DPCM  difference  signal  Is  quantized  to  one  bit  using  BTC.  A  photo¬ 
graphic  Image  was  quantized  to  1.18  bits/pixel  using  this  hybrid  tech¬ 
nique,  compared  to  a  1.63  bits/pixel  rate  for  BTC  alone  for  the  same 
Image. 
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3.2  VECTOR  QUANTIZING 

Victor  Quantizing  (VQ)  as  first  applied  to  temporal  signals  Is 
described  by  Gray  [40],  and  by  Garsho  [41]  as  applied  to  Imagery.  An 
appreciation  for  the  possibilities  Inherent  In  VQ  can  be  gained  by 
recalling  a  classical  result  from  communication  theory,  namely  that 
the  error  perfermance  achieved  In  encoding  a  sequence  of  data  samples 
may  he  decreased  to  an  arbitrarily  small  level  by  coding  the  data  In 
Increasingly  large  blocks  of  samples.  This  result  Is  known  as  the 
Block  Orthogonal  Coding  case.  The  application  of  this  principle  to 
data  compression  can  be  appreciated  by  considering  that  the  same  tech¬ 
nique  of  block  data  encoding  could  be  applied,  not  to  reduce  error 
with  Increasing  block  length,  but  holding  the  error  rate  constant  with 
increasing  block  length,  and  compensating  for  the  potentially  Improved 
error  performance  by  allowing  quantizing  noise  to  Increase  by  coarser 
quantizing.  The  net  would  be  to  encode  with  fewer  bits,  thus  achiev¬ 
ing  compression.  In  general  then,  encoding  the  samples  In  blocks  or 
vectors  rather  than  Individually  (as  scaler  amplitudes)  can  be  used 
either  to  achieve  compression  or  to  reduce  error  rate.  Reducing  the 
average  number  of  bits  per  encoded  sample  while  encoding  blocks  Is 
called  Vector  Quantization. 

Figure  3-5  Illustrates  the  basic  principle  of  VQ  with  a  very  sim¬ 
ple  one-dimensional  case  encoding  signal  samples  by  pairs.  Figure  3-6 
extends  the  technique  to  the  two-dimensional  Image  case.  The  key  step 
of  the  process  Is  the  selection  of  the  optimum  mapping  from  an  Input 
sample  block  or  vector  to  one  of  a  limited  set  of  representative  code¬ 
vectors.  The  set  of  possible  codevectors  must  be  chosen  based  on  the 
probability  densities  of  the  Input  blocks  so  as  to  minimize  the  over¬ 
all  average  error  or  Image  distortion.  The  most  popular  means  of 
determining  this  codebook  mapping  Is  by  employing  an  optimization 
algorithm  known  as  the  LBG  algorithm,  In  reference  to  the  three  au¬ 
thors  (Unde,  Buzo,  and  Gray,  [42])  who  developed  It.  A  codebook 
produced  by  such  a  process  for  Imagery  data  Is  represented  by  a  set  of 
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Typical  Spatial  Signal  or  Image  Samples 
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FIGURE  3-6.  TWO-DIMENSIONAL  VECTOR  QUANTIZATION 


representative  pixel  patches  as  shown  In  Figure  3-7.  In  this  example 
case,  the  64  representative  pixel  blocks  are  arrayed  In  an  8  x  8  ma¬ 
trix  (the  matrix  presentation  Is  arbitrary,  only  the  number  of  unique 
blocks  Is  significant).  Any  3  x  3  (in  this  case)  Input  data  block  Is 
replaced  by  the  most  appropriate  representative  block  from  the  code¬ 
book  matrix.  Thus  as  one  would  expect,  the  codebook  consists  of 
blocks  or  Image  patches  having  various  Intensities  and  gradients,  with 
some  of  the  blocks  having  typical  edge  patterns. 

Compression  results  of  0.66  bit/pixel  can  be  achieved  using  3x3 
Input  blocks  with  a  library  of  64  codevector  blocks.  Such  VQ  tech¬ 
niques  are  found  to  encode  visual  imagery  with  very  little  perceptual 
degradation  except  at  sharp  edges,  where  a  staircase  effect  Is  found. 
This  latter  distortion  has  been  attacked  by  using  a  Classified  VQ 
(CVQ)  to  first  classify  the  block  as  to  type  In  order  to  separate  out 
the  blocks  having  edges  and  encode  them  from  a  special  code  table  for 
edge  blocks.  Ramamurthl  and  Gersho  describe  CVQ  In  [43].  A  number  of 
further  extensions  to  VQ  are  given  by  Nasrabadl  In  [44],  Including 
predictor  techniques  and  vector  coding  of  transform  coefficients. 
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4.0  RECOMMENDATIONS 

No  results  of  the  application  of  VQ  to  SAR  Imagery  are  available 
at  present.  Considering  the  nature  of  SAR  Imagery  as  described  ear¬ 
lier,  It  may  be  possible  suggest  areas  for  Investigation  as  to  the 
best  means  to  employ  VQ  with  SAR  Imagery: 

•  To  what  degree  does  the  edge  fidelity  criterion  apply  to  SAR 
Imagery. 

•  The  effect  that  Speckle-reduction  algorithms  would  have  on 
simplifying  codebook  selection  operations. 

•  The  Inherent  degradation  of  target  patterns  which  would  be 
Incurred  using  VQ  (Jagged  edges  for  example)  may  affect  Image 
utility  different  for  human  Image  Analysts  (IAs)  than  It  would 
affect  current  generation  Automatic  Target  Recognition  (ATR) 
algorithms;  the  relative  effects  of  such  degradations  should 
be  Investigated  by  a  comparative  experimental  effort. 

It  Is  thus  recommended  that: 

•  A  codebook  optimization  algorithm  be  Implemented  (such  as  the 
LBG  algorithm)  and  run  against  representative  SAR  Imagery 
containing  tactical  targets  of  Interest. 

•  In  house  trained  IA's  be  employed  to  assess  the  degradation  of 
target  signatures  for  simple  VQ  (as  opposed  to  Classified  VQ, 
or  CVQ) .  This  assessment  would  Include  a  recommendation  as  to 
whether  a  more  complex  multistage  VQ  approach  such  as  CVQ  be 
employed  to  retain  edge  fidelity. 

•  The  utility  of  speckle  reduction  for  pre-VQ  processing  or 
post-VQ  edge  smoothing  should  be  Investigated. 

•  A  modest  performance  single  stage  VQ  having  small  blocks 
(3x3  recommended)  be  used  for  baseline,  with  codebook  sizes 
of  54  and  256  be  employed. 
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